Search CORE

21 research outputs found

Evolutionary approaches for feature selection in biological data

Author: Dang Vinh Q.
Publication venue: Edith Cowan University, Research Online, Perth, Western Australia
Publication date: 01/01/2014
Field of study

Data mining techniques have been used widely in many areas such as business, science, engineering and medicine. The techniques allow a vast amount of data to be explored in order to extract useful information from the data. One of the foci in the health area is finding interesting biomarkers from biomedical data. Mass throughput data generated from microarrays and mass spectrometry from biological samples are high dimensional and is small in sample size. Examples include DNA microarray datasets with up to 500,000 genes and mass spectrometry data with 300,000 m/z values. While the availability of such datasets can aid in the development of techniques/drugs to improve diagnosis and treatment of diseases, a major challenge involves its analysis to extract useful and meaningful information. The aims of this project are: 1) to investigate and develop feature selection algorithms that incorporate various evolutionary strategies, 2) using the developed algorithms to find the “most relevant” biomarkers contained in biological datasets and 3) and evaluate the goodness of extracted feature subsets for relevance (examined in terms of existing biomedical domain knowledge and from classification accuracy obtained using different classifiers). The project aims to generate good predictive models for classifying diseased samples from control

Research Online @ ECU

Effectiveness of cervical pessary compared to cervical cerclage with or without vaginal progesterone for the prevention of preterm birth in women with twin pregnancies and a short cervix : Study protocol for a two-by-two factorial randomised clinical trial

Author: Bui Trung Q.
Dang Vinh Q.
Dang Vinh Q.
He Yen T.N.
He Yen T.N.
Le Cam H.
Le Thanh V.
Li Wentao
Mol Ben W.
Nguyen Diem T.N.
Nguyen Loc M.T.
Pham Ha N.H.
Trieu Tuyen T.T.
Vuong Lan N.
Vuong Nhu T.
Publication venue: 'BMJ'
Publication date: 01/06/2020
Field of study

Peer reviewedPublisher PD

Aberdeen University Research

Effectiveness and safety of in vitro maturation of oocytes versus in vitro fertilisation in women with high antral follicle count : Study protocol for a randomised controlled trial

Author: Dang Vinh Q.
Giang Nhu H.
Gilchrist Robert B.
Ho Tuong M.
Ho Vu N.A.
Le Anh H.
Mol Ben W.
Norman Rob J.
Pham Toan D.
Phung Tuan H.
Smitz Johan
Vuong Lan N.
Wang Rui
Publication venue: 'BMJ'
Publication date: 01/12/2018
Field of study

This work was supported by Ferring grant number 000323 and sponsored by My Duc Hospital.Peer reviewedPublisher PD

Aberdeen University Research

Incorporating Genetic Algorithm into Rough Feature Selection for High Dimensional Biomedical Data

Author: Dang Vinh Q.
Lam Chio- Peng
Lee Chang Su
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

In this paper, a hybrid approach incorporating genetic algorithm and rough set theory into Feature Selection is proposed for searching for the best subset of optimal features. The approach utilizes K-means clustering for partitioning attribute values, the rough set-based approach for reducing redundant data, and the genetic algorithm for searching for the best subset of features. A set of six attributes was obtained as the best subset using the proposed algorithm on the colon cancer dataset. Classification was carried out using this set of six attributes with 23 classifiers from WEKA (Waikato Environment for Knowledge Analysis) software to examine their significance to classify unseen test data. In addition, the set of 6 genes found by the proposed approach was also examined for their relevance to known biomarkers in the colon cancer domain

Crossref

Research Online @ ECU

NSC-GA: Search for optimal shrinkage thresholds for nearest shrunken centroid

Author: Dang Vinh Q
Lam Chiou-Peng P
Lee Chang Su
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

In this paper, a hybrid approach incorporating the Nearest Shrunken Centroid (NSC) and Genetic Algorithm (GA) is proposed to automatically search for an optimal range of shrinkage threshold values for the NSC to improve feature selection and classification accuracy for high dimensional data. The selection of a threshold value is crucial as it is the key factor in the NSC to find significant relative differences between the overall centroid and the class centroid. However, selecting this threshold value via \u27trial and error\u27 in empirical approaches can be time-consuming and imprecise. In the proposed NSC-GA approach, shrinkage threshold values for the NSC are encoded as genes in chromosomes that are evaluated using a fitness measure obtained from the classifier in the NSC. The proposed approach automatically searches for the optimal threshold for the NSC by utilizing GA. The proposed approach was evaluated using a number of data sets; Alzheimer\u27s disease, Colon and Leukemia cancer datasets. Experimental results indicated that the proposed approach finds the optimal range of shrinkage thresholds for each dataset, subsequently leading to a higher classification result and involving a smaller number of features when compared to previous studies

Crossref

Research Online @ ECU